i 



15 



"Express Mail" Mailing Label No. EL739929949US 

PATENT APPLICATION 
ATTORNEY DOCKET NO. SUN-P5569-RJL 



1 0 APPARATUS AND METHOD FOR ERROR 
DETECTION ON SOURCE-SYNCHRONOUS 

BUSES 



Inventor: Satyanarayana Nishtala 



BACKGROUND 

20 Field of the Invention 

The present invention relates to detecting communications errors between 
functional units in a computer system. More specifically, the present invention 
relates to an apparatus and a method for detecting errors on a source-synchronous 
bus within a computer system. 

25 

Related Art 

It is essential for the various functional units of a computing system to 
communicate with each other in order for the computing system to perform its 
assigned tasks. Traditionally, these functional units, which include the central 
30 processing unit, memory, I/O devices, and the like, are coupled together by a bus 
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structure. When a first functional unit needs to communicate with a second 
functional unit, the fost functional unit typically requests access to the bus from a 
bus master. The bus master then grants the first functional unit exclusive access 
to the bus for a bus transaction. During the transaction, the bus is not available to 

5 the other functional units. 

In older, slower computing systems, a global clock signal is distributed to 
each of the functional units. Typically, within each function unit, the global clock 
signal is regenerated using circuitry such as a phase-lock loop. Regenerating this 
global clock signal removes any noise on the clock signal, thereby virtually 

1 0 eliminating errors in data transferred between functional units attributable to the 
clock signal. 

More modem computing systems, however, operate at higher clock rates, 
which causes problems when a global clock signal is used. Since the global clock 
signal arrives at a functional unit by a different route than the data signals, the 
1 5 clock signal can be offset in time from the data signals. At these higher clock 

rates, the margins for error are much smaller than the margins at lower clock rates. 

In an effort to alleviate the problems associated with global clock signals, 
designers have developed source-synchronous buses. A source-synchronous bus 
differs from older bus systems in that the clock signal is routed along with the 
20 other data signals between the source and the destination. Care is taken to provide 
the same path lengths and environment for the clock as for the data. Source- 
synchronous buses, therefore, essentially eliminate any offset in time between the 
clock signal and the associated data signals. 

However, using a source-synchronous bus can cause problems. The clock 
25 signal is not regenerated in the source-synchronous bus as it is for the global clock 
signal used in previous structures, but instead, the clock signal is used as received. 
Since the clock signal used with a source-synchronous bus is subject to the same 
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environment as the data signals, the clock signals are subject to the same kind of 
errors as are found on the data signals. 

However, errors on the clock signals are more catastrophic than similar 
errors on the data signals. This is so because a faulty clock signal affects all of the 
5 data signals whereas a faulty data signal affects only the one data signal. This 
makes error detection on a source-synchronous bus difficult. For example, a 
parity bit can be used to detect a single bit error. A clock error, however, affects 
multiple bits and may not be detected by a parity bit. 

Furthermore, an error correcting code typically corrects a single bit error 
] 0 and detects most multiple bit errors. However, if the clock signal is faulty, it is 
probable that more than one data bit would be affected, thereby negating the 
advantage of using the error correcting code. 

What is needed is an apparatus and a method for detecting errors on a 
source-synchronous bus while maintaining the high throughput associated with a 
1 5 source-synchronous bus. 



SUMMARY 

One embodiment of the present invention provides a system for detecting 
errors on a source-synchronous bus. The source-synchronous bus includes a 

20 plurality of data lines and a clock line. A transmitting mechanism configured to 
transmit data on the source-synchronous bus is coupled to the source-synchronous 
bus. A receiving mechanism configured to receive data from the source- 
synchronous bus is also coupled to the source-synchronous bus. An error 
detecting mechanism configured to detect errors on the source-synchronous bus is 

25 coupled to the receiving mechanism. The error detecting mechanism can detect 
errors on the plurality of data lines including errors that are caused by an error on 
the clock line. 
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In one embodiment of the present invention, the system includes a 
grouping mechanism coupled to the transmitting mechanism that is configured to 
group data bits into an error group. The system also includes a detection code 
generating mechanism coupled to the grouping mechanism that is configured to 
generate a detection code for the error group. The transmitting mechanism is 
further configured to transmit the detection code on the source-synchronous bus 
using a clock cycle other than the clock cycle used for the error group. 

In one embodiment of the present invention, the detection code is a parity 

bit. 

In one embodiment of the present invention, the detection code is an error 
correcting code. 

In one embodiment of the present invention, the grouping mechanism is 
configured to skew data bits within the error group across time. 

In one embodiment of the present invention, skewing data bits across time 
includes delaying each data bit based on the position of the data bit within the 
error group. 

In one embodiment of the present invention, the system provides a 
gathering mechanism coupled to the receiving mechanism that is configured to de- 
skew the data bits within the error group. 

BRIEF DESCRIPTION OF THE FIGURES 

FIG. 1 illustrates computing elements coupled together by source- 
synchronous bus 106 in accordance with an embodiment of the present invention. 

FIG. 2 illustrates details of detection code generator 1 16 in accordance 
with an embodiment of the present invention. 

FIG. 3 A illustrates details of data bit grouper 1 12 in accordance with an 
embodiment of the present invention. 

4 

Attorney Docket No. SUN-P5569-RJL Inventor: Nishtala 

EJGC.VMY DOCUMENTS\SUN MICROSYSTEMS\SUN-P5569-RJL\SUN-P5569-RJL APPLICATION DOC 



FIG. 3B illustrates a group of coded data 206 in accordance with an 
embodiment of the present invention. 

FIG. 3C illustrates skewed data 306 in accordance with an embodiment of 
the present invention, 
5 FIG. 4 illustrates details of data bit gatherer 11 4 in accordance with an 

embodiment of the present invention. 

FIG. 5 illustrates details of receive error detector 1 18 in accordance with 
an embodiment of the present invention. 

FIG. 6 is a flowchart illustrating the process of sending data on source- 
10 synchronous bus 106 in accordance with an embodiment of the present invention. 

FIG. 7 is a flowchart illustrating the process of receiving data from source- 
synchronous bus 106 in accordance with an embodiment of the present invention. 



DETAILED DESCRIPTION 

1 5 The following description is presented to enable any person skilled in the 

art to make and use the invention, and is provided in the context of a particular 
application and its requirements. Various modifications to the disclosed 
embodiments will be readily apparent to those skilled in the art, and the general 
principles defined herein may be applied to other embodiments and applications 

20 without departing from the spirit and scope of the present invention. Thus, the 
present invention is not intended to be limited to the embodiments shown, but is 
to be accorded the widest scope consistent with the principles and features 
disclosed herein. 



25 Computing Elements 

FIG. 1 illustrates computing elements coupled together by source- 
synchronous bus 106 in accordance with an embodiment of the present invention. 
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Source 102 and destination 104 are coupled together by source-synchronous bus 
106. Source 102 can include any source of data transactions within a computing 
system. For example, source 102 can include a central processing unit. 
Destination 104 can include any destination of data transactions within a 
5 computing system. For example, destination 104 can include a memory 
subsystem. 

Source 102 includes data transmitter 108, data bit grouper 1 12 and 
detection code generator 116. Detection code generator 116 generates error 
detection codes for data to be transmitted on source-synchronous bus 106. The 
10 operation of detection code generator 1 1 6 is discussed below in conjunction with 
FIG. 2. 

Data bit grouper 112 groups the data bits into an error group. After the 
data bits are grouped into an error group, the bits are skewed across time. Details 
of data bit grouper 1 12 are discussed in conjunction with FIG. 3 A. 

1 5 Data transmitter 1 08 receives the skewed data from data bit grouper 1 12 

and sends the skewed data to destination 104 across source-synchronous bus 106. 

Destination 104 includes data receiver 110, data bit gatherer 1 14, and 
receive error detector 118. Data receiver 110 receives the skewed data sent by 
source 102 across source-synchronous bus 106 and passes the received data to 

20 data bit gatherer 114. Data bit gatherer 1 14 de-skews the received data. Details 
of the operation of data bit gatherer 1 14 is discussed below in conjunction with 
FIG, 4, 

Receive error detector 1 1 8 receives the coded data from data bit gatherer 
1 14. Receive error detector 118 detects errors in the data as described below in 
25 conjunction with FIG. 5. If receive error detector 118 detects no errors, the data 
becomes available for use by destination 104. 
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Detection Code Generator 

FIG. 2 illustrates details of detection code generator 1 16 in accordance 
with an embodiment of the present invention. Detection code generator 1 16 
includes parity generator 204, Detection code generator 116 accepts incoming 
5 data 202 from source 102. In FIG. 2, data 202 includes eight data bits, however, 
there is nothing inherent in the present invention that restricts the incoming data to 
eight bits. A person of ordinary skill in the art will be able to extend the ideas of 
this invention to include any number of data bits. 

Parity generator 204 generates a parity bit for data 202. This parity bit can 
10 be configured as odd parity or even parity as desired. The output of parity 
generator 204 is supplied by detection code generator 1 16 as coded data 206. 

A person of ordinary skill in the art will be able to replace parity generator 
204 within detection code generator 116 with any other detection code generator 
or by an error correcting code generator. The number of bits added to data 202 by 
1 5 detection code generator 1 16 to make coded data 206 can be more than one bit. 

Data Bit Grouping 

FIG. 3 A illustrates details of data bit grouper 1 12 in accordance with an 
embodiment of the present invention. Data bit grouper 1 12 includes grouping 

20 mechanism 302 and skewing mechanism 304. Grouping mechanism 302 groups 
the incoming data into one or more words, where a word is the number of bits in 
coded data 206. In this example, the number of bits in a word is nine. The 
number of words that grouping mechanism 302 includes in a group can be 
determined empirically by the designer to protect data transmissions from errors 

25 without adversely affecting detection latency. A group of data words is described 
in more detail in conjunction with FIG. 3B 
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Optionally, data bit grouper 112 can include a mechanism, such as a 
longitudinal parity mechanism, for checking the data bits on a single line. Such 
longitudinal parity bits would be appended to the group as an additional data 
word. 

5 Skewing mechanism 304 provides skewed data 306 to data transmitter 108 

for transmission on source-synchronous bus 106. Skewing mechanism 304 skews 
the bits of the data group across time so that for each clock signal, the data word 
of skewed data 306 sent during that clock signal includes bits from several words 
of coded data 206. Skewing mechanism 304 operates by inserting a different 

10 number of delay cycles into each data line. Details of skewed data 306 and the 
delay mechanisms in skewing mechanism 304 are discussed in greater detail in 
conjunction with FIG. 3C below. 

FIG. 3B illustrates a group of coded data 206 in accordance with an 
embodiment of the present invention. Grouping mechanism 302 combines a 

1 5 quantity of words, designated by the subscript n, of coded data 206 into a group as 
shown. Optionally, the last word of a group can be a longitudinal parity word, or 
other group error word as discussed above. 

The bits within a group are numbered by a subscript, such as 27 in d27. 
The two indicates the data word within the group, while the seven indicates the bit 

20 within the word. The subscript on the parity bit, such as the four in p4 indicates 
that this is the parity bit for word four. 

FIG, 3C illustrates skewed data 306 in accordance with an embodiment of 
the present invention. Skewing mechanism 304 includes delays in the paths of the 
various bits to delay the bits by a different number of clock periods for each bit 

25 hne. As illustrated in FIG. 3C, the parity bit is not delayed, data bit seven is delay 
by one clock cycle, data bit six is delayed by two clock cycles, and so on until data 
bit zero is delayed for eight clock cycles. 
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By skewing the data as shown in FIG. 3, bits from several different data 
words are transmitted during each clock cycle. For example, during the time that 
bit zero from word zero, doo, is being transmitted, bit one from word one, bit two 
from word two, bit three from word three, bit four from word four, bit five from 
5 word five, bit six from word six, bit seven from word seven, and the parity bit 
from word eight are being transmitted. An error on the clock signal during this bit 
time could affect several of these bits simultaneously. However, after the bits are 
de-skewed, each bit will be back in its own data word, and any data words with 
erroneous bits will be detected because the erroneous bits will no longer be in the 
10 same data word. 

Data Bit Gatherer 

FIG. 4 illustrates details of data bit gatherer 1 14 in accordance with an 
embodiment of the present invention. Data bit gatherer 1 14 receives skewed data 
15 400 from data receiver 1 10 within destination 104. Skewed data 400 is the 
received data from source-synchronous bus 106 and includes skewed data 306 
modified by any errors in skewed data 306 while in transit on source-synchronous 
bus 106. 

Data bit gatherer 114 includes de-skewing mechanism 402. De-skewing 
20 mechanism 402 is similar in construction to skewing mechanism 304. However, 
in de-skewing mechanism 402, data bit zero is not delayed, data bit one is delayed 
one clock time, and so on. This provides a total delay of eight clock cycles for 
each data line and for the parity line, which results in coded data 404. 

Coded data 404 is coded data 206 plus any errors that were introduced on 
25 the data while in transit across source-synchronous bus 106. Coded data 404 is 
coupled to receive error detector 1 1 8 so that errors in coded data 404 can be 
detected, and optionally, corrected. 
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Receive Error Detection 

FIG. 5 illustrates details of receive error detector 1 18 in accordance with 
an embodiment of the present invention. Coded data 404 is supplied to receive 

5 error detector 118. Receive error detector 1 1 8 includes error detecting mechanism 
502. Error detecting mechanism 502 is the complement of parity generator 204 
within detection code generator 1 16. Error detection mechanism 502 examines 
coded data 404 to determine if any bits are in error. Remember, however, that 
there should be no more than one bit in error even if the error was caused by an 

1 0 error in the clock signal. 

Optionally, error detecting mechanism 502 can be replaced with forms of 
error detection other than parity checking, and can include an error correcting 
mechanism. Error detecting mechanism 502 can also be configured to check for 
errors using a longitudinal parity if one is generated within source 102. 

1 5 If error detecting mechanism 5 02 does not detect any errors, recovered 

data 504 is supplied to destination 104. 

Sending Data on the Bus 

FIG. 6 is a flowchart illustrating the process of sending data on source- 
20 synchronous bus 1 06 in accordance with an embodiment of the present invention. 
The system starts when detection code generator 116 receives data 202 from 
source 102 to be sent to destination 104 across source-synchronous bus 106 (step 
602). 

Parity generator 204 within detection code generator 116 generates the 
25 detection code for data 202 to create coded data 206 (step 604). Coded data 206 
is then sent to data bit grouper 1 12 where grouping mechanism 302 groups coded 
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data 206 into error groups (step 606). Next, skewing mechanism 304 skews the 
data to create skewed data 306 (step 608). 

Finally, data transmitter 108 sends skewed data 306 across source- 
synchronous bus 106 to destination 104 (step 610). 

5 

Receiving Data from the Bus 

FIG. 7 is a flowchart illustrating the process of receiving data from source- 
synchronous bus 106 in accordance with an embodiment of the present invention. 
The system starts when data receiver 110 within destination 104 receives skewed 

10 data 400 from source 102 across source-synchronous bus 106 (step 702). Note 
that skewed data 400 is skewed data 306 plus any errors generated during 
transmission across source-synchronous bus 106. 

Data receiver 1 1 0 passes skewed data 400 to data bit gatherer 114. De- 
skewing mechanism 402 within data bit gatherer 1 14 de-skews the data to recreate 

1 5 the error groups for error detecting mechanism 502 (step 704). Data bit gatherer 
1 14 then passes coded data 404 to receive error detector 118. 

Error detecting mechanism 502 within receive error detector 118 next 
determines if there are any errors in coded data 404 (step 706). If there are no 
errors in coded data 404 (step 708), processing continues at 710. Recovered data 

20 504 is passed to destination 104 for use (step 710). If there are errors in coded 

data 404, receive error detector 118 reports the error to destination 104 (step 712). 

The foregoing descriptions of embodiments of the present invention have 
been presented for purposes of illustration and description only. They are not 
intended to be exhaustive or to limit the present invention to the forms disclosed. 

25 Accordingly, many modifications and variations will be apparent to practitioners 
skilled in the art. Additionally, the above disclosure is not intended to limit the 
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present invention. The scope of the present invention is defined by the appended 
claims. 
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